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Abstract — We consider the delay properties of max-weight 
opportunistic scheduling in a multi-user ON/OFF wireless system, 
such as a multi-user downlink or uplink. It is well known 
that max-weight scheduling stabilizes the network (and hence 
yields maximum throughput) whenever input rates are inside 
the network capacity region. We show that when arrival and 
channel processes are independent, average delay of the max- 
weight policy is order-optimal, in the sense that it does not 
grow with the number of network links. While recent queue- 
grouping algorithms are known to also yield order-optimal delay, 
this is the first such result for the simpler class of max-weight 
policies. We then consider multi-rate transmission models and 
show that average delay in this case typically does increase with 
the network size due to queues containing a small number of 
"residual" packets. 



I. Introduction 

We consider the delay properties of max-weight opportunis- 
tic scheduling in a multi-user wireless system. Specifically, 
we consider a system with N transmission links. Each link 
receives independent data that arrives randomly and must 
be queued for eventual transmission. Separate queues are 
maintained by each link i € {1, . . . , N}, so that data arriving 
to queue i must be transmitted over link i. The system works 
in slotted time with normalized slots t 6 {0,1,2,...}. The 
channel states of each link vary randomly from slot to slot, 
and every slot t the network controller observes the current 
queue backlogs and the current channel states, and selects a 
single link for wireless transmission. 

This is a classic opportunistic scheduling scenario, where 
the network scheduler can exploit knowledge of the current 
state of the time varying channels. It is well known that max- 
weight scheduling policies are throughput optimal in such 
systems, in the sense that they provably stabilize all queues 
whenever the input rate vector is inside the network capacity 
region. This stability result was first shown by Tassiulas and 
Ephremides in [2] for the special case of ON/OFF channels, 
and was later generalized to multi-rate transmission models 
and systems with power allocation [3] [4| |5] |6|. However, the 
delay properties of max-weight scheduling are less understood. 
An average delay bound that is linear in N is derived in [5| 
0. While this bound is tight in the case of correlated arrival 
and channel processes, it is widely believed to be loose for 
independent arrivals and channels. 
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In this paper, we focus on the special case of ON/OFF 
channels, and show that the max-weight policy indeed yields 
average delay that is 0(1) under independence assumptions. 
Thus, average delay does not grow with the network size and 
hence is order optimal. While our previous queue grouping 
results in [7| also demonstrate that 0(1) delay is possible, 
this is the first such result for the simpler class of max- 
weight policies. Specifically, we first show that for any input 
rate vector that is within a p-scaled version of the capacity 
region (where p represents the network loading and satisfies 
< p < 1), the max-weight rule yields average delay that 
is less than or equal to clog ^l^y . where c is a constant 
that does not depend on p or This is in comparison 
to the previous delay bound of developed for max- 
weight scheduling 0. Note that our new bound does 
not grow with N, but has a worse asymptotic in p. We next 
present a different analysis that improves the delay bound to 
cl °g(i/(i-p)) f or S y S t ems w j m "/-balanced" traffic rates (to 
be made precise in later sections). That is, if arrival rates are 
heterogeneous but are more balanced (so that the difference 
between the maximum arrival rate and the average arrival 
rate is sufficiently small), then order-optimal average delay 
is maintained while the delay asymptotic in p is improved. 

Finally, we consider systems with multi-rate capabilities. We 
first provide a delay bound that grows linearly with N, similar 
to the bounds in but with an improved coefficient. We 
then provide an example multi-rate system and show that its 
average congestion and delay must grow at least linearly with 
N under any scheduling algorithm, due to many queues having 
a small number of "residual" packets. This is an important 
example and demonstrates that the 0(/V) behavior of the 
multi-rate delay bound is fundamental and cannot be avoided, 
highlighting a significant difference between single-rate and 
multi-rate systems. 

It is known that order-optimal delay requires queue-based 
scheduling. Indeed, it is shown in [7] that average delay in an 
N-usei downlink with time varying channels grows at least lin- 
early with N if queue-independent algorithms are used (such 
as round-robin or randomized schedulers). Related results are 
shown for N x N packet switches in [8|, where a delay gap 
between queue-aware and queue-independent algorithms is 
developed. Delay optimal control laws for multi-user wireless 
systems are mostly limited to systems with special symmetric 
structure |2| |9] 1 10] . Delay optimality results are developed 
in IfTTI for a heavy traffic regime in the limit as the system 
loading p approaches 1, Recent results on exponents of the 

1 The value c is used here to easily express a delay scaling relationship, and 
represents a generic coefficient that does not depend on p or N . The value c 
is not necessarily the same in all places it is used. 
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tail of delay distributions are provided in |[T2l |[T3l , and order- 
optimal delay for greedy maximal scheduling with p a constant 
factor away from 1 is considered in lfT4l |fT5l . 

The max-weight rule is also called the Longest Connected 
Queue (LCQ) scheduling rule in the special case of an 
ON/OFF downlink. This policy was developed by Tassiulas 
and Ephremides in [2], where it was shown to support the 
full network capacity region and to also be delay optimal in 
the special symmetric case when all arrival rates and ON/OFF 
probabilities are the same for each link. The fact that the actual 
average delay of LCQ in such symmetric cases is 0(1) was 
recently proven in [10] (which shows that doubling the size 
of a symmetric system does not increase the average delay) 
and [7] (which uses a queue-grouped Lyapunov function 
to bound the average delay). Delay properties of variations 
of LCQ for symmetric Poisson systems are considered in 
llT6l in the limit of asymptotically large N. For asymmetric 
systems, it is shown in [7| that a different algorithm, called 
the Largest Connected Group (LCG) algorithm, yields 0(1) 
average delay. However, the LCG algorithm requires some 
statistical knowledge to set up a queue-group structure. Hence, 
it is important to understand the delay properties of the 
simpler max-weight rule, which does not require statistical 
knowledge. In this paper, we combine the queue grouping 
concepts developed in Q together with two novel Lyapunov 
functions to provide an order-optimal delay analysis of max- 
weight. The first Lyapunov function we use has a weighted 
sum of two different component functions, and is inspired 
by work in ifTTl where a Lyapunov function with a similar 
structure is used in a different context. 

In the next section, we specify the network model and 
review basic concepts concerning the network capacity region. 
Section Hn] proves our first delay result for the ON/OFF chan- 
nel model with general heterogeneous traffic rates inside the 
capacity region. Section [IV] provides our second bound (with 
a tighter asymptotic in p) for the case of heterogeneous traffic 
rates but under an /-balanced traffic assumption. Section 
M treats multi-rate systems. Section [Vl] provides simulation 
results. 

II. System Model 

Consider a multi-user wireless system with N transmission 
links. The system operates in slotted time with normalized 
slots t 6 {0, 1,2,.. .}. We assume that data is measured in 
units of fixed size packets, and let Ai (t) represent the number 
of packets that arrive to link i G {1, . . . , N} during slot t. 
Each link i maintains a separate queue to store this arriving 
data, and we let Qi(t) represent the number of packets waiting 
for transmission over link i. 

Let Si(t) represent the channel state for the zth channel 
during slot t. We assume that Si(t) is a non-negative integer 
that represents the current transmission rate (in units of pack- 
ets/slot) available over channel i if this channel is selected for 
transmission on slot t. For most of this paper, we consider the 
simple case of ON/OFF channels, where Si(t) £ {0,1} for 
all channels i G {1, . . . , N} (multi-rate systems are treated in 
Section 0. Define S(t) = (Si(t), . . . , S N (t)) as the channel 
state vector. 



Let pii{t) represent the control decision variable on slot t, 
given as follows: 

, . _ J Si(t), if channel i is selected on slot t 
\ 0, otherwise 

Define fi(t) — (/ii(i), . . . ,/j,N{t)) as the vector of transmis- 
sion decisions. We also call this the transmission rate vector, 
as it determines the instantaneous transmission rates over each 
link (in units of packets/slot), where the rate is either or 1. 
The constraint that at most one channel is selected per slot 
translates into the constraint that fi(t) has at most one non-zero 
entry (and any non-zero entry i is equal to Si(t)). Define J-(t) 
as the set of all such control vectors fj,(t) that are possible for 
slot t, called the feasibility set for slot t. The queue dynamics 
for each queue i £ {1, . . . , N} are given as follows: 

Qi(t + 1) = max[Qi(t) - ^(t), 0] + A,{t) (1) 

subject to the constraint fi(t) € F(t) for all t. 

A. Traffic and Channel Assumptions 

We assume the arrival processes Ai(t) are independent 
for all i 6 {1, . . . , N}. Further, each process Ai(t) is i.i.d. 
over slots with mean Aj = E{Ai(i)} and with a finite 
second moment E{yli(i) 2 } < oo. Similarly, we assume 
channel processes Si(t) are independent of each other and 
i.i.d. over slots with probabilities Pr[Si(t) = 1] = pi for 

' {i w. 

B. The Network Capacity Region 

Suppose the network control policy chooses a transmission 
rate vector every slot according to a well defined probability 
law, so that the queue states evolve according to ([T). 

Definition 1: A queue Qi(t) is strongly stable if: 

1 

lim sup - E {Qj (t) } < oo 

We say that the network of queues is strongly stable if all 
individual queues are strongly stable. Throughout, we shall 
use the term "stability" to refer to strong stability. 

Define A as the network capacity region, consisting of the 
closure of all arrival rate vectors A = (Ai, . . . , Ajv) for which 
there exists a stabilizing control algorithm. In J2| it is shown 
that the capacity region A is the set of all rate vectors A = 
(Ai, . . . , Aat) such that for each of the 2^ — 1 non-empty link 
subsets C C {1, . . . , N}, we have: 

^2Xi<l-Il ie c(l-Pi) (2) 

This is an explicit description of the capacity region A. The 
following alternative implicit characterization is also useful for 
analysis (see |6| and references therein): 

Theorem 1: (Capacity Region A) The capacity region A 
is equal to the set of all (non-negative) rate vectors A = 
(Ai, . . . , Aat) for which there exists a stationary randomized 
control policy that observes the current channel state vector 
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S(t) and chooses a feasible transmission rate vector fi(t) G 
F{t) as a random function of S(t), such that: 

Xi=E{fj,i(t)} for all i e {1, . . . , iV} (3) 

where the expectation is taken with respect to the random 
channel vector S(t) and the potentially random control action 
that depends on S(t). □ 

It is easy to see that any non-negative rate vector that 
is entrywise less than or equal to a vector A G A is also 
contained in A. This follows immediately from Theorem Q] by 
modifying the stationary randomized policy /z(t) that yields 
E{/Xj(i)} = Aj to a new policy fi(t) by probabilistically 
setting each /ij (t) value to zero with an appropriate probability 
q u yielding E {/*(<)} = E{ W (t)} (1 - ft) < E {/*(<)}. 

It is also easy to show that the capacity region A is 
convex and compact (i.e., closed and bounded). Further, if 
E{Si(t)} > for all i G {l,...,N}, then A has full 
dimension of size N and hence has a non-empty interior. 

C. The Max-Weight Scheduling Policy 

Given a rate vector A interior to the capacity region A, 
a stationary, randomized, queue-independent policy could in 
principle be designed to stabilize the system, although this 
would require full knowledge of the traffic rates and channel 
state probabilities. However, it is well known that the fol- 
lowing queue-aware max-weight policy stabilizes the system 
whenever the rate vector is interior to A, without requiring 
knowledge of the traffic rates or channel statistics [2 |: Each 
slot t, observe current queue backlogs and channel states 
Qi(t) and Si(t) for each link i, and choose to serve the link 
i*{t) G {1, . . . ,N} with the largest Qi(t)Si(t) product. This 
is also called the Longest Connected Queue policy (LCQ) ||2), 
as it serves the queue with the largest backlog among all that 
are currently ON. 

The max-weight policy is very important because of its 
simplicity and its general stability properties. However, a tight 
delay analysis is quite challenging, and prior work provides 
only a loose upper bound on average delay that is 0(N), 
i.e., linear in the network size [5| [6|. It is shown in [7| that 
0(1) average delay is possible when both channels and packet 
arrivals are independent across users and across timeslots, and 
when no traffic rate is larger than the average traffic rate by 
more than a specified amount. The 0(1) delay analysis of 
uses an algorithm called Largest Connected Group that is 
different from the max-weight policy and that requires more 
statistical knowledge to implement. In the following, we use 
the queue grouping analysis techniques of [7| to show that 
the simpler max-weight policy can also provide O(l) average 
delay, and does so for all traffic rates within a p-scaled version 
of the capacity region. However, the scaling in p is worse than 
that in [7|. Section HVlreco vers the same p scaling as [7 | under 
a similar "/-balanced" traffic assumption. 

III. Delay Analysis for Arbitrary Rates in A 

Consider the ON/OFF channel model where each Si (t) is an 
independent i.i.d. Bernoulli process with Pr[Si(t) = 1] = pi. 
Assume the arrival rate vector A = (Ai, . . . , Xn) is interior to 



the capacity region A, so that there exists a value p such that 
< p < 1 and: 

A G pA (4) 

That is, A is contained within a p-scaled version of the capacity 
region. The parameter p can be viewed as the network loading, 
measuring the fraction the rate vector A is away from the 
capacity region boundary. Define A to t(t) as the total packet 
arrivals on slot t: 

N 
i=l 

Define X to t — 52i=i ^» as tne sum P ac ket arrival rate. Because 
the sum of the entries of any rate vector in the capacity region 
A is no more than 1, we have by Q that X to t < P- 

A. Important Parameters of A 

To analyze delay, it is useful to characterize the N- 
dimensional capacity region A in terms of its size on subspaces 
of smaller dimension. To this end, define p m i n as the smallest 
channel ON probability: 

Pmin= min pi 
ie{i,...,N} 

We assume that < p m %n < 1- For each positive integer K, 
define parameters fi a ^ m and tk as follows: 

T K = 1 - (1 - Vrnin) K 

Thus, r/f = Kfi s ^ m . The following lemma shall be useful. 

Lemma 1: For any positive integer K and any probability 
Pmin > 0, we have p s ^ m > That is: 

^[1 - (l-Pmin)"} > ^-[1 - (1 - P ,mn) K+1 ] 

Proof: See Appendix D. ■ 
Further, for K G {1, ... , N}, let Ck represent a particular 
subset of K links within the link set {1, . . . , N}. For each 
subset Ck, define lc K as an Af-dimensional vector that is 1 
in all entries i G Ck, and zero in all other entries. 

Lemma 2: For each set Ck of size K (for any integer K 
such that 1 < K < N) we have: 

1 Ck G A 

Furthermore, for each integer k such that 1 < k < K and for 
any set Ck that contains k links, we have p s ^ m lc k £ A. 

Proof: We first prove that p s ^ m l CK G A. By ©, it suf- 
fices to show that for any integer m such that 1 < m < K, the 
sum of any m non-zero components of ^ m lc K is less than 
or equal to r m @ That is, it suffices to show that mp s ^ m < r rn . 
But this is equivalent to showing that p s ^ m < /i^ m for 
m < K, which is true by Lemma Q] Finally, the fact that 
/^"'l-Cfc G A (for any integer k such that 1 < k < K) 
follows because any rate vector with entries less than or equal 
to another rate vector in A is also in A. ■ 

2 Note that r m < 1 — rijg£ rTi (l — p;), where C rn is any subset of m 
links. 
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Thus, p s ^ m can be intuitively viewed as an edge size such 
that any AT-dimensional hypercube of this edge size (with 
dimensions defined along the orthogonal directions of any K 
axes of M. N ) can fit inside the capacity region A. 

B. The 0(1) Delay Bound for Arbitrary Traffic in A 

Suppose the LCQ algorithm is used together with a sta- 
tionary probabilistic tie breaking rule in cases when multiple 
queues have the same weight. This allows the queueing system 
to be viewed as a stationary Markov chain. In this case, it is 
well known that if the arrival rate vector is interior to the 
capacity region, then all queues are stable under LCQ, with 
a well defined steady state time average |6[. The following 
0(N) delay bound for LCQ is given in J7) 



I 



2r N (l-p) 

where W represents the average delay in the system. The 
bound © also holds for arrival vectors A(t) that are i.i.d. over 
slots but with possibly correlated entries A4 (t) on the same slot 
t. The next theorem demonstrates an improved O(l) bound in 
the case when all arrival processes Ai(t) are independent. 

Theorem 2: (Delay Bound for LCQ) Consider the ON/OFF 
channel model and assume processes Ai{t) and Si(t) are 
independent and i.i.d. over slots. Assume that A G pA for 
some network loading p such that < p < 1. Let K be any 
integer such that vk+i > Xtot, that isQ 



l-(l-Pn 



\K+1 



>\t. 



(6) 



Then the max-weight (LCQ) policy for the ON/OFF channel 
model stabilizes all queues and yields: 



±Q t < KB > C 

i=l 



(1-P) 2 



where Q i is the time average number of packets in queue i, 
and where the constants Bg, C, and 9 are defined: 



1 



N 



N 



: . )Z >{A(t) a }-EA? 

i=l i=l 
+ e -[^{A tot {tf}+\ tot -2X 2 tot ] 



TK+l 



c 



N KX tot | r K( r K+l~ X tot) 



if K < N 



(i-p) 



if K > N 

if K <N 
if K > N 



(7) 
(8) 

(9) 
(10) 



3 The bound in |7) is of the form c/e, where t is any value such that 
A + e £ A, where e is a vector with all values equal to e. The bound {5) 
follows by observing that e = (1 — p)rjv /N satisfies A + e € A whenever 
A £ pA. A similar c/e bound is given in |5J (6) for more general multi-rate 
systems. 

4 Note that \ tot < p < 1, and hence there is always a suitably large value 
if such that (6) holds. 



By Little's Theorem, average delay W thus satisfies: 

KB e C NB 



W < min 



(ID 



_A to t(l - p) 2 ' A tot rAr(l - p) 

where Bq represents the value of Bg with 6 = 0, and the 
second expression in the above min[-, •] function is identical 
to the previous delay bound (0. 

The proof of Theorem [2] is given in the next subsection. 
We note that the right term inside the min[-, •] operator in 
(fTTT l is smaller in the case K > N. The above bound can be 
minimized over all positive integers K that satisfy tk+i > 
Xtot- For a simpler interpretation of the bound that illuminates 
the fact that this is an 0(1) delay result, note that because 
(l+p)/2 > p > Xtot, we can ensure that (O holds by choosing 
K to satisfy: 

l-(l-p mm ) K+1 > (l+p)/2 
Choosing K as follows accomplishes this: 

log(2/(l - p)) 



K 



L 



1 



(12) 



l0g(l/(l -Pmin)) 

Because Xtot < p, it is not difficult to show that with this 
choice of K, we have rx+i — Xtot > (1 — p)/2. Thus, in the 
case K < N we have: 



C < 



TK+l 



< 



'A" 



(1-P) 

Because = 1/rjv for the case K > N, we have that 
O = O(l) (regardless of whether or not if < AT). Further, we 
have from ( Tf2l that AT is proportional to log(l/(l — p)) but 
independent of AT. Finally, if arrival processes are independent 
so that E{A tot (t) 2 } = 0(1), we have Bg/X to t = 0(1). 
Therefore, the delay bound of ( fTTT l has the form: 



W < min 



cilog(l/(l-p)) c 2 N 



(13) 



[i-pY '(l-p)_ 

where c\ and C2 are constants that do not depend on p or 
AT. If A^ itself is small, then the right expression in the 
above min[-, •] can be smaller than the left expression (i.e., 
the previous delay bound (f5]) can be the same as our new 
delay bound in the case when A^ is small). However, if the 
loading p is held fixed as Af is scaled to infinity, then the left 
expression in the min[-, •] is always smaller and demonstrates 
O(l) average delay (see also simulations in Figs. [JJand|2]of 
Section [VTb . Thus, LCQ is order-optimal with respect to N. 
However, the left delay bound has a worse asymptotic in p, 
and so it would be worse than the right bound in the opposite 
case when N is fixed and p is scaled to 1. 

C. Lyapunov Drift Analysis 

To prove Theorem it suffices to consider only the case 
K < N, as the delay bound in the opposite case K > N 
is identical to the previous delay bound (0). Let Q(t) = 
(Qi(t), . . . , Qjv(£)) be the vector of queue backlogs. Define 
Qtot(t) as the sum queue backlog in all queues of the system: 



N 

Qtot(i)=^<3i(t) 
»=i 



(14) 
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Define the following Lyapunov function: 

Lm))=h Eli qm 2 + f (eJLi Qj(t)) 2 d5) 

where 9 is a positive constant to be determined later. Thus, 
L{Q{t)) = | Eti Qtit) 2 + iQtotit) 2 . This Lyapunov func- 
tion uses the standard sum of squares of queue length, and 
adds a new term that is the square of the total queue backlog. 
This new term incorporates the queue grouping concept similar 
to (7|, and will be important in obtaining tight delay bounds. 
The technique of composing this Lyapunov function as a sum 
of two different quadratic terms weighted by a 9 constant 
shall be useful in analyzing both stability and delay in two 
different modes of network operation, and is inspired by a 
similar technique used in ifTTl to analyze stability in a very 
different context. Specifically, work in IfTTl considers multi- 
hop networks with greedy maximal scheduling and achieves 
stability results when input rates are a constant factor (such as 
a factor of 2) away from the capacity region boundary. 

Here, we consider a single-hop network with time-varying 
channels, and obtain both stability and order-optimal delay 
results for all input rates inside the capacity region. The 
intuition on why this 2-part Lyapunov function allows a tight 
delay bound is as follows: The first term is a standard sum of 
squares of queue length, and ensures stability of the algorithm 
while creating a large negative drift when the number of non- 
empty queues is small. However, this term also has a relatively 
small negative drift when the number of non-empty queues is 
large, preventing 0(1) delay analysis from this term alone. 
To compensate, the second term is a square of the sum of 
all queues, which does not significantly affect the drift of the 
first term when the number of non-empty queues is small, but 
creates a large negative drift to help the first term when the 
number of non-empty queues is large. 

The queue dynamics (HJ can be rewritten as follows: 

Qi{t + l) = Q i {t)-p i {t)+A i {t) (16) 

where p^t) = min[Qj(i), pi(t)]. Define p, to t(t) = 
Y^i—i P>i(t)> being either or 1, and being 1 if and only if the 
system serves a packet on slot t. The dynamics for Qtot{t) 
are given by: 

Qtot{t + 1) = Qtot(t) - Ptot(t) + A tot {t) (17) 

where A to t{t) = Y^Li Ai(t). Let Q{t) be the stochastic 
queue evolution process for a given control policy. Define the 
one-step conditional Lyapunov drift as follows|f| 

A(Q(t))4E {L(Q(t + 1)) - L(Q(t)) | Q(t)} (18) 

Lemma 3: The Lyapunov drift A(Q(t)) for the ON/OFF 
channel model satisfies: 

A(Q(t)) = E{B(t)\Q(t)} 

N 

-j2Qim{vi(t)-*i\Q{t)} 

i=i 

-9Q tot (t)E{p tot (t) - Xtot\ Q(t)} 

5 Strictly speaking, correct notation should be A(Q(t),t), as the drift 
could be from a non-stationary policy, although we use the simpler notation 
A(Q(t)) as formal notation for the right hand side of )18t . 



where pi(t) and ptot(t) correspond to the LCQ policy, and 
where B(t) is given by: 

m a ^ + ^J2[Mt) 2 -2A i (t)p i (t)) 

Q 

+ 2 [Atot(t) 2 + Ptot(t) - 2p tot (t)A tot (t)} (19) 

Proof: (Lemma [3]) See Appendix A. ■ 
Now note that the LCQ algorithm chooses /j,(t) S T(t) on 
each slot t to maximize Ei=i Qi an d hence: 

N N 
i=l i=l 

where pi*(t) = (p*(t), . . . , p%(t)) is any other feasible 
transmission rate vector in J-(t). It follows that the above 
inequality is preserved when taking conditional expectations 
given the current Q(t) value. Plugging this result into the 
second term on the right hand side of the drift expression 
in Lemma [3] thus yields: 

A(Q(i)) < E {£?(*) | Q{t)} 

N 

-^ft(()EK(()-A,| Q(t)} 

i=l 

~9Q tot {t)E{fi tot (t) - X tot | Q(t)} (20) 

where fi*(t) = (/j,l(t), . . . , p%(t)) is any other feasible con- 
trol action on slot t. Note that ptot(t) in the above expression 
still corresponds to the LCQ policy. 

Let L(t) represent the number of non-empty queues on slot 
t, so that < L(t) < N. 

, Case 1 (L(t) < K): Suppose L(t) < K, and let C(t) 
represent the set of non-empty queue indices. Recall 
that p s ^ m ic(t) £ A (by Lemma |2|i and that \/p e A 
(by assumption that A G pA). By taking a convex 
combination of these two vectors and using convexity 
of the set A, it follows that: 

A + (1 — p)pf m l C (t) G A (21) 

Now let pi*(t) be the stationary randomized policy that 
makes decisions based only on the current channel state, 
and that yields: 

EK(t)}=A + (l-^7'l £(i) 

Such a policy exists by ( f2Tb and Theorem Q] Thus, for 
all i £ C{t) we have: 

EK(i)} = U(l-« m (22) 

Using (l22l in the drift inequality ( f20b and noting that 

Qi(t) = if i i C(t) yields: 

N 

A(Q(t)) < E{B(t)\Q(t)}-J2Q l (m-p)^ m 

+0Q t ot(t)X lot 
= E{B(t)\Q(t)} 

-Q tot {t)[{\ - P )^ m -extot] 
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Define e as follows: 

e4[(l - p)^ m - 9X tot ] (23) 

It follows that: 

A(Q(t)) < E {B(t) | Q(t)} - eQ tot {t) (24) 

• Case 2 (£(t) > AT): Suppose Lit) > K, and again let 
C(t) represent the set of non-empty queue indices. Note 
that X/p G A and p s ^ m l G A, where 1 is the all 1 vector. 
By convexity of A, the convex combination is also in A: 

A + (l-p)^ m lGA 

Now let fi*(t) be the stationary randomized policy that 
makes decisions independent of queue backlog, and that 
yields for all i G {1, . . . , N}: 

E{^(t)}=X i + (l-p)^ m (25) 

Such a policy exists by Theorem [T] Note that when the 
number of non-empty queues is greater than K, there is 
a packet departure under the LCQ policy with probability 
at least one minus the product of the K + 1 largest OFF 
probabilities: 

E {hot I Q(t)} > 1 - n ie£ K+i (1 - Pi ) > r K +i (26) 

where Ck+i represents the set of K + 1 links with the 
smallest success probabilities. Plugging d25b and d26*l > into 
the drift inequality (f20b yields: 

A(Q(t)) < E{B(t)\Q(t)} 

-Qtot{t)[(l - p)/i?T + 0(r K +i ~ Xtot)} 

To equalize the drift in both Case 1 and Case 2, we choose 
9 to satisfy: 

e = (1 - p)p^ m + 9{r K +i - Xtot) 
Thus (using ([231): 

e = (i- P )(pT l -^ m ) 

TK+l 

(! ~ P) [VN m ^tot + ^K m { r K+l ~ Xtot)} 

Recall that we have assumed K < N (as Theorem [2] is 
trivially true if K > N, as described at the beginning of this 
subsection). Thus, we have p s ^ m > pf^ m (by Lemma Q]), and 
so we indeed have 9 > 0. Further, because rjf+i > Xtot, we 
have that e > 0. Therefore, the drift inequality d24l holds in 
both Case 1 and Case 2 (and hence holds for all t and all 
Q(t)). We now use the following well known Lyapunov drift 
lemma (see, for example, for a proof): 

Lemma 4: (Lyapunov Drift 0) If the drift A(Q(t)) of a 
non-negative Lyapunov function satisfies the following for all 
t and all Q(t): 

A(Q(t)) <E{B(t) | Q(t)} - eE{f(t) \ Q(t)} 

for some stochastic processes B(t), f(t), and some constant 
e > 0, then: 

7<B/e 



where 

/ = limsup-^E{/(r)} 




Using this Lyapunov drift lemma in ( 1241 (using fit) — 
Qtot(t)) yields: __ 

Qtot < B/e 

We note that because the system evolves according to a 
Markov chain with a countably infinite state space, the 
time averages are well defined (so that the limsup can be 
replaced by a regular limit). Further, using the fact that 
lim^oo j Y^t=o ^ {Ai( r )} = tr, e value of B can be seen 
to equal the value Be defined in 0, proving Theorem [2] 

IV. A Tighter Bound for "/-Balanced Traffic" 

Here we present a tighter bound on average backlog and 
delay of the LCQ algorithm for the ON/OFF channel model. 
Our bound in this section is of the form clog(yA_)/(l — p), 
which is still O(l) with respect to the network size N, but 
yields a better asymptotic in p. Unfortunately, our analysis 
does not hold for all rate vectors A inside the capacity 
region A. Rather, we make the following assumption about 
a more "balanced" traffic rate vector. Let A = (Ax, . . . , Ajy), 
and without loss of generality assume that A,; > for all 
i G {1, . . . , N} (else, we can redefine N to be the number 
of links with non-zero rates). Define Xtot = YliLi ^» an( ^ 
Xav = Xtot /N. We say that A has f -balanced rates if there is 
a constant / such that: 

Ai<A „ + / for all i G {1, . . . , N} (27) 

That is, A is /-balanced if no individual traffic rate is more 
than an amount / above the average rate X av . Clearly any 
uniform traffic rate vector is /-balanced for f — 0. However, 
this definition of /-balanced rates also captures a large class 
of heterogeneous arrival rate vectors. We shall prove our delay 
results under the assumption that / is suitably small. A similar 
assumption is used in [7|, and our delay analysis relies heavily 
on the queue-grouping techniques used there. 

A. The Queue-Grouped Lyapunov Function 

Fix an integer K such that 1 < K < N. Define N as the 
smallest multiple of K that is larger than or equal to N: 

N=\N/K\K (28) 

Now define a new rate vector A = (Ai, . . . , Ajv, 0,0,..., 0), 
where the last N — N entries are zero. Define N — N 
"fictitious" queues for these last dimensions (these queues 
always have zero backlog, but shall be convenient to define 
for counting purposes). Define Qk as the set of all possible 
partitions of the link set {1, ... , N} into K disjoint sets, each 
with an equal size of N/K links. Let g G Qk denote a 
particular partition, and define df' , . . . , Cjjr as the collection 
of sets corresponding to g (so that the union U^L^l is equal 
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to {1, .. . ,N}, and the intersection n Cm is empty for 
all m ^ n, where m, n £ {1, . . . , K}). 

For a particular partition g, define Q k 9 \t) as the sum of all 
queue backlogs in the fcth set of g: 

Qk\t)= E &(*) 

Define the following queue-grouped Lyapunov function: 

L(Q(m\ E Ewi 5) w) 2 (29) 

k=i 

This is similar to the Lyapunov function of Q, with the 
exception that it sums over all possible partitions into K 
disjoint groups. For intuition, we note that the /-balanced traf- 
fic assumption allows the "Largest Connected Group" (LCG) 
argument of to proceed on any set of K disjoint groups. 
However, once we fix a particular group, minimizing the drift 
gives rise to the LCG algorithm rather than the "max-weight" 
LCQ algorithm. Changing the Lyapunov function by summing 
over all possible K disjoint groups yields a similar negative 
drift as in LCG, but the "symmetry" induced by summing over 
all groups remarkably makes the drift minimizing algorithm 
the LCQ algorithm (rather than LCG). 

Define A {9) (t) and p i9) (t) as the sum arrivals and depar- 
tures from the fcth group of the partition g: 



4 9 \t) = 



4 9) w 



E Mt) 

sec<«> 

E 



The dynamics for the fcth group of partition g thus satisfy: 



Qi 9 \t + l) = Q[ 9 >(t)-^>(t) + A)°>(t) 



rXa), 



i (9)/ 



(30) 



Define the Lyapunov drift A(Q(t)) as before (given in (fTSIl). 

Lemma 5: For a general scheduling policy, the Lyapunov 
drift satisfies: 



(t)E{^\t)-X^\Q(t)} 



A(Q(t)) = E{C(t)\Q(t)} 

- E E^ ; 

gS0K k=l 

where 9 = 5Z i£jC (») Aj, and where C(t) is defined: 

C ^=\ E E[4 9) w+4 ff) (*) 2 -24 9) w4 s) w 

q£Gk fc=l 

(31) 

Proof: The proof is similar to the proof of Lemma [3] 
Specifically, note from ( f30l > that: 



,(y) 



(^ + 1) 2 



/#>(*)+ 4?" (*) 



GO, 



-2A^(*)4 9) (*) - 2gi 9) w[a1 9) w - 4 9) (*)] 

where we have used the fact that /Etr^ (t) 2 = pf?' (t). The result 
follows by summing over all fc and all groups, and taking 
conditional expectations. ■ 



Remarkably, we next show that the "max-weight" LCQ 
algorithm for this ON/OFF channel model minimizes the final 
term in the right hand side of the above drift expression. 

Lemma 6: (Max Weight Matching) Every slot t, the LCQ 
algorithm chooses a transmission rate vector fj,(t) € J-(t) 
that maximizes the following expression over all alternative 
feasible transmission rate vectors: 

E E«i ff) (*)4 9) (*) 

Proof: See Appendix B. ■ 
It follows that we can replace the variables p~ 9 \t) in 
the final term of the drift expression in Lemma [5] which 
correspond to the LCQ policy, with variables /I^ *(t) that 
correspond to any other feasible rate vector fi*(t) E J~{t), 
while creating an inequality relationship: 

A(Q(t)) < E{C(t)\Q(t)} 

- E EQi 9) WE{/ii 9) *(t)-Ai 9) |QW}(32) 
g ^g K k=l 

The drift inequality (l32l is quite subtle: It is defined in terms 
of any other single feasible rate vector fi* (t) (where this vector 
does not depend on the partition g). Note that the variables 
p^*(t) are defined for different partitions g £ Qk, but for 
each particular g these variables are still derived from the same 
vector fi*(t). They are derived from fi*(t) by summing the 
components of this rate vector that have non-empty queues 
over the dimensions that correspond to the groups within the 
particular partition g. 

B. Optimizing the Drift Bound 

Here we manipulate the sum in the right-hand side of ( |32| > 
to yield a useful drift bound. 

Lemma 7: For any vector A = (Ai, . . . , A^), if there is a 
value (3 such that < < 1 such that for all i £ {1, ... , N} 
we have: 

N K 

then: 

K 



(33) 



[Xtot + gg(l - P)} 
K 



E E«n*) A * ^ Qtot{t)\Q K \ 

g&Gic k=l 

where \Gk\ is the cardinality of Qk, Qtotif) is the total sum 
backlog (defined in (O), and z is defined: 

zA(l - 1/20/(1 - UN) (34) 

Proof: The proof of Lemma |7] follows from simple 
counting arguments, and is given in Appendix C. ■ 

Note that A to t/N < \ to t/N with approximate equality 
when N is large (so that N/N rj 1). The constraints (|33T > 
imply that A is /-balanced with / = f3(l — p)/K. 

Lemma 8: There exists a single randomized strategy that 
observes queue backlogs and channel states for slot t and 
chooses fi*(t) £ Tit) such that: 

K 

E E^i 9) (*) E {4 9) *(*) i Q(t)} > Qtotit) \q k \ r -§ 

g&QK fc=l 
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where r K = 1 - (1 - p min ) K . 

Proof: See Appendix C. ■ 
Using Lemmas [7] and [8] in the drift inequality d32i > yields: 
Lemma 9: If A e pA (for < p < 1) and if (l33l l is satisfied 

for all i G {1, ... ,7V}, then: 

A(Q(i)) < E{C(t)\Q(t)} 

-Q tot (t)\g K \ [rK - Xtot -^ {1 - p)] (35) 

Lemma [9] leads immediately to the delay theorem stated in 
the next subsection. 



C. An Improved Delay Bound for f -Balanced Traffic 

Theorem 3: (Delay Bound for ON/OFF Channels with /- 
Balanced Traffic) Suppose A G pA for < p < 1. Let K be 
the smallest integer that satisfies tk > (1 + p)/2, that is: 

lA'i 



[i-(i-p m ™n > (i + p)/2 



(36) 



Suppose that K < N, and the /-balanced traffic constraints 
d33l are satisfied for some value /3 such that < (3 < l/(2z), 
where zk(l -1/K)/(1 -l/N) (note that z < 1). If the max- 
weight (LCQ) policy is used on this ON/OFF channel model, 
then average queue occupancy satisfies: 



Qt 



< 



KD 



(l-p)(i- z(3) 



W < 



K(D/Xtot) 

(i-p)a-zp) 



(37) 



where D is defined: 
1 



Dk- [A tot +E{^ ot (t) 2 }] 

Further, in the special case when N is a multiple of K, and 
when traffic is uniform and Poisson with A; = Atot /N for all 
i, we have (3 = andf] 



/ i 2KX t 



< 



\ot\ 



W< 2K 



Xi 



1-p - 1-p 

Note that the constraint ( f36t is satisfied by: 



K = 



l0g(l/(l -Pmin)) 



Therefore, K is independent of N, and is proportional to 
]°e(j~~)- Assuming that traffic streams are independent, so 
that E{A tot (t) 2 } = 0(1), implies that D = 0(1). Thus the 
delay bound gives W < c '^CVt 1- ^)) (where c is a constant 
independent of p and iV), being independent of the network 
size N and having an asymptotic in p that is better than that 
of Theorem |2] 

Proof: (Theorem [3} Because A G pA, we have Atot < P 
(as the maximum sum rate is at most rjy < 1). The assumption 
on tk in d36t thus implies: 

[r K ~ hot ~ 2/3(1 - p)] > (1 -p){\ ~ zl3) 

6 These bounds for symmetric Poisson traffic are obtained from the last line 
of the proof of Theorem [5] which gives a slightly smaller bound than that 
achieved by plugging f3 = 0, E {A to t(t) 2 } = \ tot + Af oi into W\ . 



The above value is strictly positive because z(3 < 1/2. Using 
the drift inequality ( f35l > directly in the Lyapunov Drift Lemma 
(Lemma HI yields: 



< 



KC 



\g K \(i- p )a-z0) 



Using the definition of C (t) in OTI ) and the fact that the system 
is stable (so the long term departure rate is equal to Atot) 
yields: 



C 



\Qk\\. 



K 



< \Qk\ 



Atot 
~2~ 



E {A„(<n _ IBkID 



k ) 



The above bound on C proves the first part of the theorem. The 
second part, for uniform Poisson traffic, follows by the above 
equality for C (without the bound), using E jvljf ' (t)\ = 



and E 



(4 S) (*) 2 } 



IT 5 " 



K 



for all g, k. 



V. Multi-Rate Transmission Models 

Now suppose that for each channel i G {1, . . . , N}, the 
states Si(t) are non-negative integers bounded by a finite 
integer pi :in ax, where /i^ mal represents the maximum trans- 
mission rate over channel That is, we have: 

Si(t) G {0, 1, . . . , A^ mo J for all t and all i G {1, . . . , N} 

We assume that (ii tma x > for all i. The queueing dynamics 
are governed by (dJ. The capacity region A is known to be 
equal to the set of all rate vectors that can be achieved via 
a stationary, randomized, queue-independent algorithm that 
chooses fi*(t) as a potentially random function of only the 
current S(t) vector 

The max-weight algorithm in this case is the algorithm 
that observes queue backlogs and channel states every slot 
and selects the link i G {1, . . . , A^} with the largest value of 
Qi(t)Si(t) (breaking ties arbitrarily). Suppose the arrival rate 
vector satisfies A G pA for some loading value p such that 
< p < 1. The analysis in (5| uses a standard Lyapunov 
function, given by the sum of the squares of queue backlog, 
to show the max-weight algorithm for a general downlink has 
average delay upper bounded by cN/(l — p), where c is a 
constant that is independent of N and p. We first present 
a modified version of that prior bound, which has the same 
structure but uses our particular pi :ma x notation and improves 
the c coefficient: 

Lemma 10: Suppose A(t) is i.i.d. over slots with 
E{A(i)} = A, and that the channel state vector S(t) = 
(Si(t), . . . , 5jv(*)) is also i.i.d. over slots. Suppose that A G 
pA for some value p that satisfies < p < 1. Then the system 

7 For consistency, we continue to work in integer units of packets. The 
analysis does not significantly change if Si(t) values are viewed as non- 
negative real numbers with units of bits/slot. 
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is stable under the max-weight algorithm and has an average 
delay bound given by: 



N 



W 



2\ 



(1 - p)fJ> syr , 



N min 



2^,i=l \ tot ) \ tot 



(1 ~ P)Hs 



(38) 



where S 2 is defined: 



S* 2 4E <j max Si(t) 2 
\ie{i,...,N} 



and where fi sym is defined as the largest value such that 

(Psym/N, H S y rn /N, . . . , pL sym /N) G A. 

Further, in the case when all processes Si (t) are independent 
and satisfy Pr[Si(t) = p,i ;max ] > 0, we can bound p sym as 
follows: 

Psym > A(l - (1 - Pmin) N ) 

where p and p m i„ are defined: 

pi = min ^i,, mra 
ie{i,...,iV} 



Pn 



ie{X,...,N} 



Pr[Si(t) > A] 



Proof: The proof uses the Lyapunov function L(Q(t)) = 
^^2iLiQi(t) 2 , as in J6), but provides a simple modi- 
fication of the argument to yield a tighter bound (given in 
Appendix E for completeness). ■ 
We note that the above delay bound holds also in the case 
when Hi.max — oo for some values i, but when the second 
moment of transmission rates is finite (so that S 2 is finite). 
The above delay bound has the structure cN / (1— p), and holds 
even if arrival and channel vectors A[t) and S{t) have entries 
that are correlated over the different links i € {1,...,JV}. 
A similar argument can be used to show stability with the 
same structural delay bound cN/(l—p) for the modified max- 
weight policy that chooses the link i G {1, . . . , A^} with the 
largest Qi(t) min[Qj(i), Si(t)) value. This modified policy can 
sometimes provide smaller empirical average delay than the 
original max-weight policy, although its resulting analytical 
delay bound has a slightly worse coefficient c > c (this 
modified policy is equivalent to the original max-weight policy 
in the case of ON/OFF channels with pi, ma x = 1 for all 
i). Similar to the ON/OFF case, one might suspect that for 
this multi-rate system, average delay that is independent of 
N can be achieved when arrival and channel processes are 
independent over each channel. However, the next subsection 
presents an important example that shows this is not the caseQ 

8 We note that our original pre-print of this paper in 1 18 1 incorrectly claimed 
that multi-rate systems also have delay that is independent of N . The mistake 
in 1181 arose when plugging the equation from Lemma 10 of that paper into 
equation (33) of that paper. Plugging one equation into the other implicitly 
assumed that the sum queue backlog in queues with at least fi m ax packets 
is the same as the total queue backlog in the system. This is true when 
(%„ = 1, but is not true in general as it neglects the "residual" packets in 
queues with fewer than Umax packets. 



A. An example showing necessity of O(N) delay 

Here we present an example showing that the average 
number of queues that have at least one packet but fewer 
than fii.max packets must be linear in JV", which necessarily 
makes the average delay of any scheduling policy grow at least 
linearly with N. Consider a system with N queues with sym- 
metric channels and traffic. Assume that A^ > 3 and suppose 
that all arrival processes A\ (t) are independent and Bernoulli 
with Pr[Ai(t) = 1] = 3/N for all i G {1, . . . , N} (so that 
\i = 3/N for all i, and X to t — 3 packets/slot). Now suppose 
that all channels have pi :Tn ax — 5, and channel state processes 
are i.i.d. with Pr[S t {t) = 5] = 1/2, Pr[S t {t) = 0] = 1/2 for 
all i G {1, . . . , A^}. The largest symmetric rate in the capacity 
region of this system is p sym /N — 5(1 — (1/2) N )/N, and 
hence the arrival rate vector is inside the capacity region and 
has p given by: 

3 

p ~5(i-(i/2n 

Note that p < 1 for A^ > 3, and p is approximately 3/5 
for large N. Here we show that under any scheduling policy, 
in steady state the average number of non-empty queues in 
this system must be linear in N. Specifically, consider any 
scheduling policy, and let Z(t) represent the number of non- 
empty queues on slot t. For simplicity, we assume that Z(t) 
has a well defined steady state under the scheduling policy. 
The intuition behind our proof is that Z(t+ 1) is formed from 
Z(t) by adding the number of new non-empty queues created 
and subtracting any non-empty queue that becomes empty. 
The number of non-empty queues subracted can be at most 
1 (as we can serve at most one channel per slot), while the 
average number of new non-empty queues added is more than 
one whenever Z(t) < N/2. 

Lemma 11: Consider any scheduling policy for which Z(t) 
has a well defined steady state distribution. Then for the system 
above (with Xi = 3/N and pi.max — 5 for all i G {l,...,iV}) 
we have that in steady state: 

Pr[Z{t) > N/2] > 1/3 

and hence E{Z(<)} > N/6. That is, the average number 
of non-empty queues is at least N/6, and hence the average 
number of packets in the system is at least N/6. 

Proof: Define A(t)=Z(t + 1) - Z(t) as the change in 
Z(t) from one slot to the next. Let t be a time at which the 
system is in steady state. We thus have E {A(t)} = 0. On the 
other hand, we have the following: 



E {A(t) | Z(t) > N/2} 
E{A{t) | Z{t) < N/2} 



> 
> 



-1 

XiN/2 



1 



(39) 
1/2 (40) 



where ( 1391 follows because the drift cannot be less than — 1 
on any slot (as at most one non-empty queue can become an 
empty queue), and (|40| > holds because, given that Z(t) < N/2, 
the average number of new non-empty queues that are created 
on slot t is equal to the average number of new arrivals to the 
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empty queues, which is at least Xi(N/2), It follows that: 

= E{A(t)} 

= E {A(t) | Z(t) > N/2} Pr[Z{t) > N/2] 

+E{A(<) | Z(t) < N/2} (1 - Pr[Z{t) > N/2]) 
> (-l)Pr[Z(t) > N/2] 

+(l/2)(l-Pr[Z(t)>N/2}) 

Therefore Pr[Z(t) > N/2] > 1/3, completing the proof. ■ 

VI. Simulations 

Here we present simulations for the ON/OFF system with 
independent channel and arrival processes. We assume that 
Pr[S t (t) = ON] = 1/2 for all i G {1,...,N}. We first 
consider symmetric Bernoulli arrivals, so that A,; = A for all i, 
where A is chosen so that A G pA with p = 0.8. We simulated 
the system over 10 6 slots for values of N between 3 and 300. 
The resulting simulated queue averages are shown in Fig. Q] 
together with the two 0(1) bounds and the previous O(N) 
bound. Note that the previous O(N) bound is a considerable 
overestimate of queue backlog. Our new bounds do not grow 
with N, and our second 0(1) bound (derived for /-balanced 
traffic rates) is indeed tighter than the first O(l) bound (for 
N > 9), although it applies only to /-balanced traffic while 
the first bound applies to any traffic rates in pA. However, 
there is still a significant gap (roughly a factor of 10 in this 
example) between our tightest bound and the simulated value. 
We next consider heterogeneous traffic rates implemented on 
the same ON/OFF system. We assume that N is odd, and 
choose rates A; given as follows: 

f A forie{l,...,(JV-l)/2} 
Aj = < 2A forie{(iV-l)/2 + l,...,JV-l} 
[ 4A for i = N 

where A is chosen so that A G pA for p — 0.8. The results 
are shown in Fig. |2] Note that we plot only the first 0(1) 
bound (for heterogeneous traffic) in this case, although the /- 
balanced traffic assumption also applies in this case when N 
is sufficiently large. 

Simulations of the multi-rate system example in Section 
IV-AI were also conducted, and it was verified that average 
backlog indeed grows linearly with N due to the "residual" 
packets in queues i that have fewer than p.i. ma x packets (sim- 
ulation plots omitted for brevity). However, it was observed 
in the simulations that the total backlog due to queues with 
at least fJ.i, ma x packets is O(l). This suggests that, although 
the total average backlog in multi-rate systems may have a 
fundamental O(N) term due to residual packets, the average 
backlog may be O(l) after a term of at most 5Z i=1 (fii,max — 1) 
is subtracted out. 

VII. Conclusions 

We have presented an improved delay analysis for the max- 
weight scheduling algorithm. For ON/OFF channels, max- 
weight is equivalent to Longest Connected Queue (LCQ), and 
yields average delay that is order-optimal, being independent 
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Fig. 1. Simulation and Bounds for the ON/OFF system with symmetric 
traffic and p = 0.8. 
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2000 1 1 1 1 1 1 1 1 , 




50 100 150 200 250 300 350 400 450 500 

N 

Fig. 2. Simulation and Bounds for the ON/OFF system with heterogeneous 
traffic and p = 0.8. 



of the network size N. If an /-balanced traffic assumption 
holds, average delay was shown to maintain independence of 
N while allowing an improved asymptotic in p. For multi- 
rate channels, a delay bound of O(N) applies. Conversely, 
it is shown for a simple multi-rate example that, unlike 
ON/OFF channels, average backlog must be at least linear 
in N due to "residual" packets. Our delay analysis makes use 
of the technique of queue grouping. The particular Lyapunov 
functions introduced for this delay analysis are powerful and 
may be useful in other contexts. 



Appendix A — Proof of Lemma[3] 

Here we prove Lemma[3] Define Ai(Q(i)) and A2 {Q(t)) 
as the conditional drift for the sum of squares term and the 
square of queue backlog term, respectively, so that A(Q(t)) = 
Ai(Q(t)) + 6A 2 (Q(t)). Squaring (O and using the fact that 

p^t) 2 = jAi(t) and Qi{t)pi{t) = Q i (t)p l (t) (because pi(t) G 
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{0, 1}, and fj,i(t) = (j,i(t) if Q t (t) > 0): yields: 

-Qi(t)(fu(t) -M*)) 
= \ [Q l (t) 2 + Mtf + ik(t)] - 4i (*)&(*) 
-Qi(*)(w(*) -^i(O) 

Therefore: 

Ai(Q(t)) = E{Bi(t)|Q(t)} 

AT 

-5^Qi(t)E{A*i(t)- Ai| Q(t)} 

where 



i 



Similarly: 
1 



t(t + iy 



[Ai{t) 2 +Ai(*)] - 4 



W 2 + A tot (i) 2 

-// tot (t)y4 tot (t) 
-Qtot(t)(£tot(*)->Wt)) 



Therefore: 



A 2 (Q(t)) = E{B 2 (t)\Q(t)} 

-Qtot(t)E{fitot(t) - X tot \ Q(t)} 

where p,tot(t) 2 = fitot{t) (because it is either or 1), and 
B 2 (t)4- [&ot(t) + A tot (t) 2 ] - fitot{t)A tot {t) 

Summing A x (Q(t)) and 9A 2 (Q(t)) and noting from (O that 
B(t) = £fi(i) + 6»B 2 (t) yields the result of Lemma |3] 

Appendix B — Proof of Lemma |6] 

Define the integer to — N/K. Here we prove Lemma 
[6] Given a particular queue backlog vector Q(t), the LCQ 

algorithm maximizes the expression 53i=i Qi(t)fM,(t) over all 
fi(t) G F{t). We now show that this also maximizes the 
expression given in Lemma [6] To this end, we have: 



A" 



A" 



E E^Ww = E EQi 9) w E &(*) 

geSx fc=l g&GK fe=l 



A 



EAi(*)ft(*)lfel 



;.=i 



|gAl(m-l) 
N- 1 



where the final equality holds because link i is in every group 
that multiplies the fii(t) term, and all other links multiply this 
term the same number of times (by group symmetry). The 
above also uses the fact that (by symmetry) the number of 
group partitions for which a particular link j is in the same 



group as link i is equal to the total number of partitions 
multiplied by the probability that a randomly chosen parti- 
tion includes i and j in the same group. Define the above 
expression as f(t) for simplicity. Therefore: 



N _ 1 

fit) = Y,mQi{t)\GK\{i-j—) 

i—l 

N I N \ 

+X>w (£ Qj ' (t) ) |fe| in (41) 

The fti(t) values in the expression for f(t) are the only ones 
affected by the control action on slot t. The final term on 
the right hand side is given by . fa (t) (the total departures 
on slot t) multiplied by a non-negative constant. This final 
term is maximized by any work conserving policy that always 
transmits a packet when there is a non-empty connected 
queue. The first term on the right hand side is a non-negative 
constant multiplied by the term ^\ fa(t)Qi(t)- B ut note mat 
fa(t)Qi(t) = fii(t)Qi(t), and thus the LCQ policy maximizes 
this first term. As LCQ is work conserving, it also maximizes 
the second term, and thus maximizes /(£), proving Lemma [6] 

Appendix C — Proof of Lemmas|7]and[8] 

Proof: (Lemma |7]i Define the integer to — N/K. Using 
a counting argument similar to that of Appendix B (compare 
with (HTT i). we have that for any vector A = (Ai, . . . , A^): 



K 



N 



EE^W 

g eg K k=i 



E^(*)A;M(l-?Mr) 



i=l 



N — 1 



Tfh 1 

-rHtot (t)Xtot\g K \^— (42) 



N — 1 



Using the bound on A, given in ( 1331 ) yields: 



A 



EEsi ff W< 

9&Qk k=l 



Qtot(t)Xtot \Gk 



1 



to — 1 



m — 1 



N N(N-l) N-l 
+Q tot (t)\g K \(l-^—±)P(l-p)/K 

The result of Lemma [7] follows by using the identities: 



1 



TO — 1 



TO — 1 



N N(N-l) N-l 

TO — 1 



1 

K 



(43) 



1 



N - 1 



Proof: (Lemma [HJ Let L(t) represent the number of non- 
empty queues on slot t. If L(t) — 0, then Qtot(t) — and 
the result is trivial. Now suppose that L(t) — I, where I G 
{1, 2, . . . , N}. Define (h,...,l^) to be a 0/1 vector with 
li = 1 if and only if Qi(t) > 0. Define to be the number of 
non-empty queues in the fcth group of partition g. Consider the 
following randomized policy for £t*(t) G J~(t): First observe 
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all channel states Si(t) for non-empty queues i, and define 
new channel states §i(t) as follows: If Si(t) = (OFF), assign 
Si(i) = 0. If Si(i) = 1 (ON), independently assign Si(t) = 1 
with probability p m in/Pi (this is a valid probability because 
Pmin < Pi)- It follows that the new channel state vector S{t) 
has independent and symmetric ON probabilities p m in- Now 
independently, randomly, and uniformly choose a queue to 
serve over all non-empty queues i with Si(t) = 1. It follows 
that for all non-empty queues i we have: 



E{#(f)|Q(*)} = 



1 - (1 -PminY 
I 



n 
i 



Further, for any g € Qk and any k E {!,..., K} we have: 



e{4 9) *wiqw}= E ®{mt)\Q(t)} 



Using this equality gives: 

A" 



E E^ s) («)e{4 9) *(*)IQ(*)} 



gefe fc=l 



(44) 



g eg K fc=i 



Now note that the values are structurally similar to the Ajj, 
values, and hence (similar to d42i >) we have (using Qi(t)k = 
Qi{t) and l tot = I): 



K 



N 



g&QK fe=l 



(<?) 



TO — 1 , 



i=l 



+Qtot(t)i\g K \ 



N 

TO — 1 

TV — 1 



Using this in d44i > yields: 



E E^ s) (*)E{Mi ff> wiQw} 

Z(m - 1) 



g eg K k=i 
-jQtot(t) \Qk 



1 - 



> nQtot(t) \Qk\ 



1 

N 



TO — 1 
f 

N - 1 
(m — 1) 



N — 1 

TO — 1 



N(N-l) AT-1 
= riQtot(t)|eA-|/X 



(45) 



(46) 



where the last equality holds by d43b . The above holds for 
= Z S {1, . . . , A}. Suppose now that I > K. In this case 
we have n > r^, proving the result of Lemma [8] for / > K. 

Consider now the final case where I 6 {1, . . . , K — 1}. Then 
from (l45T l we have: 

E f:Qi 9 \m{$>(t)\Q(t)} 



g eg K fe=i 



(47) 



Using the fact that ^ > ^fj- > ^ yields the result. 



Appendix D — Proof of LemmaQ] 

Here we prove that [i^ m > figT v Specifically, we show 
that if p is a value such that < p < 1, then for any positive 
integer A' we have: 

1[1 - (1 - p)*] > -^[1 - (1 - (48) 

To show this, note that it is trivially true for the case p = 1. 
In the opposite case where < p < 1, we can multiply (|481 ) 
by A"(A+ 1) and rearrange terms to see that the inequality is 
equivalent to the following: 



(1 -p) K +Kp(l-p) K < 1 



(49) 



Thus, it suffices to prove that d49l is true. To this end, we 
have: 



(I - p) K + Kp{\ - p) K < (I - p) K + Kp{\ - p) 

K ' K 



K-l 



< E 7 W-p) K - 



= (d-p)+p) K = i 

where the first (strict) inequality holds because < p < 1 
and hence (1 — p) K < (1 — p) K ~ 1 . This establishes d49l and 
completes the proof of Lemma Q] 

Appendix E — Proof of Lemma[To1 

The queueing dynamics are given by Qi{t + 1) = Qi(t) — 
fii(t) + Ai(t), where jii(t) = min[fii(t), Qi (t)}. Using the 
Lyapunov function L(Q(i))4| YliLi Qi(t) 2 an d performing 
a standard quadratic drift computation (see, for example, [6]), 
it is not difficult to show the drift satisfies: 

N 



A(Q(*)) = JEM^W 2 } 



JV 



i=i 

N 



N 



E w*) - E ^(*) E 1 



i=l 
N 



■^Qi(t)E{tH(t)-jH(t)\Q(t)} 



By definition of /ij(i), we have: 

Qi(t)(ni(i) - fk(t)) = jli(t)fii(t) -Ai(t) 2 

< min^i^a^/ii^),^^) 2 ] - p.i(t) 2 

Hence: 

AT 



A(Q(t)) < J]Te{^) 2 } 



A' 



i=l 
N 



N 



E - E ^( f ) E (wW 1 

1=1 i=i 

^Ejmiri [/i vmQ:c ^W,^W 2 ] I Q(*)}(50) 
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Using the fact that E{min[-, ■]} < min[E {•} , E {•}] (by 
Jensen's inequality and concavity of the min[-, •] operator), 
that the sum of a min is less than or equal to the min of a 
sum, and that J2iLi E W^) 2 I Q(*)} < S 2 , the final term on 
the right hand side of (f50b can be bounded by: 



N 
,i=l 



c E{p t (t)\ Q(i)},5 2 



Because the max-weight policy maximizes Ei=i Qi(t)fM(t) 
(given queue backlogs Q(t)) we have: 



N N 

J2Q^(m{Mt)\Q(t)}>J2 , 

i=l »=1 



i(t)EK(t)|g(t)} 



(51) 

where /U*(t) represents any alternative scheduling decision. 
Noting that X/p G A and fj, m /N G A, we have by convexity 
of A: 

A + (1 - p)» sy jN G A 

Thus, there exists a stationary randomized policy that chooses 
fi* (t) independent of queue backlog to yield: 

E { M *(f) | Q(t)} = E {n* (i)} = A + (1 — pK^/W 

Plugging this into (l5TT l and then into < f5Qb yields: 

JV 



A(Q(i)) < J|>{^) 2 } 



(i - g)/^ 



j=i 



Qi(t) 



' TV 

E< 

,1=1 



c E{/j^) | Q(t)},5 2 



(52) 



Using the Lyapunov drift lemma (Lemma on the above 
drift and noting that the system is stable with well defined 
time average limits yields: 



N 

E 

i=i 



N 



Qi 



< 



I Ei=i E {^f } - 1 Ei=i x l 



N min 



+ - 



(1 - pK: 
2^i=i A iMi 



o2 

C7 J 



(1 - P)fi sym 

where we have used the fact that lirut—voo E {pi(t)} — A, and 
lim^ocE {$(*)} > lim^E^t)} 2 = Af. Using Little's 
theorem on this congestion bound proves (l38l ). 

Now suppose that all 5i(t) processes are independent and 
Pr[Si(t) = fii.max] > Pmin f° r all £• We derive the bound on 
P-sym given in Lemma [TO] Define pA min ie { li ... j /v} Mi.maz- 
Consider the stationary and randomized algorithm fi*(t) that 
observes channel states S(t) and probabilistically places each 
link i e {1, . . . , N} in a set x(*) with probability if S'j(t) < 
/i, and with probability p m in/ 'Pr[Si(t) > p] if 5,(4) > p. 
Then x(4) contains a random number of links, and each link 
appears in x(t) independently with probability p m in- Select 



a link to serve on slot t uniformly and randomly with equal 
probability over all links in (remaining idle if K 
empty). It follows that under this policy, a particular link i is 
selected for transmission with probability exactly (1 — (1 — 



Pn 



)/N, and is selected only if Si(t) > p. Hence: 



EKWI> A(1 (1 N Pmm)N) foralHG{l,...,7V} 

It follows that the symmetric rate vector with all A^ entries 
equal to the right hand side in the above expression is in the 
capacity region A, so that p S ym/N is greater than or equal to 
this value. 
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